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Abstract 

Germline inactivating variants in BRCAl lead to a significantly increased risk of breast and ovarian cancers in carriers. While 
the functional effect of many variants can be inferred from the DNA sequence, determining the effect of missense variants 
present a significant challenge. A series of biochemical and cell biological assays have been successfully used to explore the 
impact of these variants on the function of BRCAl, which contribute to assessing their likelihood of pathogenicity. It has 
been determined that variants that co-localize with structural or functional motifs are more likely to disrupt the stability and 
function of BRCAl . Here we assess the functional impact of 37 variants chosen to probe the functional impact of variants in 
phosphorylation sites and in the BRCT domains. In addition, we perform a meta-analysis of 170 unique variants tested by 
the transcription activation assays in the carboxy-terminal domain of BRCAl using a recently developed computation model 
to provide assessment for functional impact and their likelihood of pathogenicity. 
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Introduction 

Inherited BRCAl inactivating mutations are major determinants 
of breast and ovarian cancer risk, accounting for 46-68% of cases 
with a family history of breast cancer cases [1,2,3,4]. Since 1996 
genetic testing to identify mutations in BRCAl and BRCA2 has 
been offered to women with a family history of breast and ovarian 
cancer [5,6] . Presently several different assay platforms are used to 
investigate BRCA alterations, including amplicon-based Sanger 
sequencing, target capture followed by next-generation sequenc- 
ing, and methods to detect large genomic rearrangements [5,7,8]. 
Variations found during sequencing include nonsense, frameshift, 
missense, splicing, and small insertions and deletions. Variants in 
BRCAl that lead to functional inactivation, either by compromis- 
ing gene expression, correct splicing, or protein structure and 
stability are associated with an increased risk for cancer [9]. In 
many instances, inactivation can be inferred from the DNA 
sequence alone [e.g. nonsense or frameshift changes). However, in 
cases such as missense or splicing variants the resulting impact on 
function cannot be directiy inferred. While many variants have 
been evaluated using functional assays and multifactorial statistical 
models [10,11], cancer association has not been determined for 
several variants, referred to as Variants of Uncertain Clinical 
Significance (VUS). 

An array of functional tests and computation prediction tools 
have been developed to aid in the determination of the functional 



impact of sequence variants of BRCAl, in particular, in vitro assays 
that assess the integrity and functionality of the N-terminal RING 
finger and the C-terminal BRCT tandem domains (tBRCT) of 
BRCA 1 [11,12]. Variants in these domains are more likely to have 
a functional impact [13,14]. Analysis of these variants fulfills a 
double purpose: they provide information to aid in the classifica- 
tion of variants, and inform the biology of BRCA 1 by pinpointing 
specific regions on the protein critical for different biochemical 
activities. 

In this report we conduct an analysis of a large series of variants 
located in the carboxy-terminal domain of BRCAl with a focus on 
a critical structural feature that is thought to stabilize the tandem 
BRCT domains and phosphorylation motifs. We used the 
transcription activation (TA) assay to analyze a total of 37 
variants. These include 24 naturally-occurring VUS and 13 
artificial variants to comprehensively probe phosphorylation sites 
and explore salt-bridge interactions present in the tandem BRCT, 
connecting the arginine residue at position 1 699 and the glutamic 
acid residue at position 1836 [15,16]. The TA assay has been 
extensively validated showing 100% sensitivity (0.73 to 1.0; 
95%CI) and 88.9% specificity (0.52 to 0.99; 95%CI) using a 
reference dataset of variants classified by multifactorial models 
[17]. Finally, we conduct a combined meta-analysis of published 
transcription-based assays using a Bayesian statistical model, called 
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VarCall [18], to assess the likelihood of pathogenicity given their 
functional impact. 

Materials and Methods 

Rationale for Choice of Variants 

In total we analyzed thirty seven BRCAl missense variants 
(Table 1, Figure 1). These variants represent three distinct groups: 
variants of uncertain significance in BRCAl, phosphorylation site 
variants, and salt-bridge variants in the BRCT domains. With the 
exception of R1699W, no other variant was found in the NHBLI 
Exome Sequencing Project (data release ESP6500 SI-V2). 

Variants of uncertain significance. Five variants 
(M1411T, S1457A, S1524A, I1529V and S1542A) were chosen 
to increase coverage of the unstructured region (aa 1396-1648) of 
BRCAl C-terminus (aa 1396-1863, corresponding to exons 13 to 
24 coding sequence, encompassing the two BRCT tandem 
domains) [19]. Three of these naturally-occurring variants 
(S1457A, S1524A and S1542A) are located in BRCAl phosphor- 
ylation sites. Several phosphorylation sites have been identified on 
the C-terminal of BRCAl that are involved in biological functions 
such as cell cycle checkpoint and caspase activation [20]. We also 
generated artificial (cannot result from a single nucleotide change 



in the natural codon) alanine variants to probe the role of 
phosphorylation at position S1423 and S1466 to be consistent with 
the other phosphorylation sites. Alanine was choosen to substitute 
serine residues in order to impede phosphorylation. The 
remaining 13 variants (E1660V, L1679Q, C1697F, E1731D, 
D1733Y, G1748D, A1752T, E1754V, E1765K, S1790Y, 
H1805L, D1813H and L1844P) are missense variants in the 
tBRCT domains (Table 1, Figure 1). 

BRCT salt-bridge variants. In previous studies we identi- 
fied and analyzed a naturally occurring BRCAl allele, R1699W, 
which showed a temperature-sensitive behavior [16,21]. R1699 
and El 836 are critical residues for a salt-bridge that stabilizes the 
tBRCT [15,16]. We reasoned that variants in residues involved in 
the salt-bridge could generate useful structure-function informa- 
tion and potential temperature-sensitive proteins [21]. Thus, we 
generated a panel of eight variants of residue 1699 and three 
variants of residue 1836. We also generated four double mutants 
combining changes in residues R1699 and El 836 (Table 1, 
Figure 1). 
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Figure 1. BRCA1 carboxy-terminal variants. (A) Natural and artificial (underlined) BRCAl variants in the context of the analyzed region 
(comprising amino acids residues 1396-1863). (B) BRCAl R1699 and El 836 variants in the context of the analyzed region (comprising amino acids 
residues 1560-1863). Structural models were obtained with Mupit tool (http://mupit.icm.jhu.edu/) using the Ijnx PDB structure (amino acid residues 
R1699 and El 936 are depicted in green). 
doi:1 0.1 371/journal.pone.0097766.g001 
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Table 1. BRCAl variants analyzed in this study. 



Exon 


Protein Variant (HGVS) 


DNA variant (HGVS) 


DNA variant (BiC) 


Not6S 


13 


p.Met141 IThr 


c.4232T>C 


T4351C 


VUS; Coiled-coil domain 


13 


p.Ser1423Ala 


C.42680A 


G4386A 


Disordered region; Phosphorylation site; artificial variant 


14 


p.Serl457Ala 


c.4369T>G 


T4488G 


Disordered region; Phosphorylation site 


14 


p.Ser1466Ala 


C.43980A 


C451 7 A 


Disordered region; Phosphorylation site; artificial variant 


15 


p.Ser1524Ala 


c.4570T>G 


T4689G 


Disordered region; Phosphorylation site 


15 


p.lle1529Val 


c.4585A>G 


A4704G 


Disordered region 


15 


p.Ser1542Ala 


c.4624T>G 


T4743G 


Disordered region; Phosphorylation site 


16 


p.Glu1660Val 


c.4979A>T 


A5098T 


BRCT; 


17 


p.Leu1679Gln 


c.5036T>A 


T5155A 


BRCT 


18 


p.Cysl697Phe 


C.5090OT 


G5209T 


BRCT 


19 


p.Glu1731 Asp 


C.51930C 


G5312C 


BRCT 


20 


p.Aspl733Tyr 


C.51970T 


G5316T 


BRCT 


20 


p.Glyl748Asp 


C.5243G>A 


G5362A 


BRCT 


20 


pAla1752Thr 


C.52540A 


G5373A 


BRCT 


20 


p.Glu1754Val 


C.5261 A>T 


A5380T 


BRCT 


21 


p.Glu1765Lys 


C.52930A 


G5412A 


BRCT 


22 


p.Ser1790Tyr 


C.53690A 


C5488A 


BRCT 


23 


p.His1805Leu 


c.5414A>T 


A5533T 


BRCT 


23 


pAspl813His 


C.54370C 


G5556C 


BRCT 


24 


p.Leu1844Pro 


c.5531T>C 


T565CC 


BRCT 


18 


pArg1699Gly 


C.5095OG 


5214 


Salt bridge variants 


18 


pArg1699Leu 


C.5096OT 


5215 


Salt bridge variants 


18 


pArg1699Gln 


C.5096OA 


5215 


Salt bridge variants 


18 


pArg1699Glu 


N/A 


N/A 


Salt bridge variants 


18 


p. Argl699Phe 


N/A 


N/A 


Salt bridge variants 


18 


p. Arg1699His 


N/A 


N/A 


Salt bridge variants 


18 


pArg1699lle 


N/A 


N/A 


Salt bridge variants 


18 


p.Arg1699Lys 


N/A 


N/A 


Salt bridge variants 


24 


p.Glu1836Ly 


C.5506OA 


5625 


Salt bridge variants 


24 


p.Glu1836Gly 


c.5507A>G 


5626 


Salt bridge variants 


24 


p.Glu1836Asp 


C.5508OC 


5627 


Salt bridge variants 


18/24 


p.Arg1699Lys 






Salt bridge variants (double mutants) 


p.Glu1836Lys 


18/24 


p. Arg1699Lys 






Salt bridge variants (double mutants) 


p.Glu1836Gly 


18/24 


p. Arg1699Lys 






Salt bridge variants (double mutants) 


p.Glu1836Asp 


18/24 


pArg1699Glu 






Salt bridge variants (double mutants) 


p.Glu1836Lys 


18/24 


pArg1699Glu 






Salt bridge variants (double mutants) 


p.Glu1836Gly 


18/24 


pArg1699Glu 






Salt bridge variants (double mutants) 


p.Glu1836Asp 



Non-naturally occurring variants are underlined. 
doi:l 0.1 371 /journal.pone.0097766.t001 



Constructs 

BRCAl expression constructs (controls and variants) were 
generated in 2 different construct contexts: for VUS and 
phosphorylation sites variants analysis we used the region 
comprising amino acid residues 1396-1863, corresponding to 



exon 13-24 coding sequence (Figure lA); for salt-bridge variants 
analysis we used the region comprising amino acid residues 1560- 
1863, corresponding to exon 16-24 coding sequence (Figure IB). 
AU variants were confirmed by sequencing. 
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VUS and phosphorylation sites variants. Control con- 
structs (amino acid context 1396-1863) containing the wt BRCAl, 
S1613G, M1775R, and Y1853X were generated previously 

(Figure lA) [22]. Mutations in the cDNA sequence were 
introduced by site-directed mutagenesis using plasmid p385- 
BRCAl as template, as previously described [23]. Primers 
sequences are available upon request. For each variant, both 
products (5' and 3' regions) were combined and used as a template 
for a final round of PGR using 24ENDT and UXl 3 primers [23]. 
The final PGR products were then digested with Barnlil and 
EcoK.\ and ligated to pGBT9 vector. To obtain the heterologous 
GAL4 DNA binding domain (GAL4-DBD) fusions in a mamma- 
lian expression vector, pGTBQ constructs were digested with 
Hindlll and Bamlil, a 1.8 Kb band was isolated and ligated into 
equally digested pGDNA3 vector. 

BRCT salt-bridge variants. Control constructs (amino acid 
context 1560-1863, Figure IB) containing the wt BRCAl, 
M1775R, and Y1853X were previously described [16,24]. 
Mutations in the cDNA sequence were introduced as described 
above. To obtain the double mutants the same procedure and 
primers described were used but instead of wild-type cDNA as 
templates we used the individual constructs containing the single 
mutants at position 1699. The final PCR products were then 
digested and cloned in pGBT9 vector, then subcloned in 
pCDNAS as described above. 

Transcriptional Assay 

The transcriptional assays were performed in mammalian cells 
as described [12,19,22]. Briefly, we used pG5Luc as a reporter and 
transfections were normalized with an internal control phGR-TK 
(Promega), which contains a Renilla luciferase gene under a 
constitutive TK basal promoter. Transfections were conducted in 
human HEK293T, HCC1937 [2.5] or SUM1315 cells in triplicate 
using Fugene 6 (Roche), harvested 24 h post-transfection, and 
luciferase activity was measured using the Dual Luciferase 
Reporter Assay System (Promega). Results were plotted as a 
percentage of the wild-type activity. We inferred disease relevance 
for each missense variant using a computational tool, VarGall, 
based on a Bayesian hierarchical model. [18]. 

VarGall is a tool that uses functional data, quantitative or 
categorical, as input and accounts for sources of experimental 
heterogeneity. Specifically, here we used non-normalized ratios of 
Firefly luciferase/ReniUa luciferase indexed by batch as the 
specific input data and each batch also records the ratio for the 
wild type control. VarCall generates a likelihood of pathogenicity 
for each variant given the input data in the form of a posterior 
probability of being damaging. In addition, VarCall also generates 
a Bayesian integrated likelihood ratio statistic that measures the 
degree to which the data support the hypothesis that a variant is 
protein damaging. To generate accurate probabilities VarCall uses 
large datasets and re-assesses previously anazide variants given the 
new data. Therefore, we included data for all (n = 176) variants 
previously tested under controlled conditions (see Results for 
further details). We used this model to infer disease relevance for 
all the naturally occurring variants described in this study, an 
additional set of 1 19 variants previously assayed for transcriptional 
activation [17] and provide a reanalysis of the 82 variants 
previously assessed by this model [18]. This combined set includes 
2,436 measurements of transcriptional activity for 176 unique 
variants in the region analyzed in the assay: amino acid residues 
context 1396-1863 (Tables SI and S2). Single nucleotide changes 
in this region can generate 2,740 unique variants (1,244 in the 
tBRGT). Of those, 219 have been documented in the population 
(BIG Database as of this writing) with 1 29 unique variants in the 



tBRGT domains. The present dataset represents 61% and 89.9% 
coverage of aU variants documented in the amino acid region 
limited by residues 1398-1863 and the tBRGT domains, 

respectively. VarGall can also assess results from diflerent 
construct contexts (aa 1396-1863 or aa 1560-1863) because each 
variant is assessed relative to the wild type in the same context. We 
have extensively tested several contexts for the TA assay including 
aa 1560-1863 [26], aa 1396-1863 [22], and aa 1646-1859 [17]. 
While they show diflerent absolute activities the relative activities 
of variants (for example M1775R) are comparable in terms of the 
percentage of activity of the corresponding wild type. Thus, data 
from diflerent contexts can be directiy compared using VarGall 
since each batch is compared against its own wild-type construct. 

Results 

BRCAl C-terminus VUS 

In order to increase the coverage of variants located at carboxy- 
terminus of BRCAl we selected a set of 18 naturally occuring 
missense variants (Table 1). Six are located outside the tandem 
BRCT (tBRCT) domains: M1411T, located in the coiled-coU 
region; S1457A, S1524A, 1 1529V, and S1542A, located in tiie 
unstructured region of BRCA; and L1844P located in the 
carboxy- terminal tail of the molecule. Twelve are located in 
tBRCT domains: E1660V, L1679Q, C1697F, E1731D, D1733Y, 
G1748D, A1752T, E1754V, E1765K, S1790Y, H1805L and 
D1813H (Figure lA). To comprehensively probe the role of the 
phosphorylation sites we also included two artificial variants 
located outside the tBRGT that are ATM phosphorylation target 
sites: S1423A and S1466A [20] (Table 1, Figure lA). AU variants 
are located at amino acid residues strongly conserved in the 
vertebrate lineage (Figure 2). 

Among the naturally occuring variants, I1529V, E1660V, 
L1679Q, D1733Y, E1754V, E1765K, S1790Y, H1805L and 
L1844P showed transcriptional activity values corresponding to 
the wild-type reference (Figure 3A) indicating that they have no 
detectable functional impact. Similarly, the naturally-occurring 
phosphorylation variants S1457A, S1524A, and S1542A, and the 
artificial phosphorylation site variants, SI 42 3 A, S1466A, also 
displayed transcriptional activity >75% of the wild-type activity 
(Figure 3B) suggesting that phosphorylation at these sites is not 
required for the transcriptional activity of BRGAl in normal 
growing conditions. 

The remaining six variants exhibited reduced functional 
activity. Variants M1411T, C1697F, G1748D and A1752T 
showed significantly reduced (<30% of wt) levels of activity 
(Figure 3A). Two variants, E1731D and D1813H presented activty 
levels around 50% of the wild type reference (55±10% and 
55± 1 1 % respectively, Figure 3A) suggesting that they might have 
a mild to moderate impact on the function of BRCAl. 

BRCT Salt-bridge Variants 

The structural integrity of BRGAl tBRCT is required for 
transcriptional activation activity and variants that disrupt its 
integrity are strongly correlated to cancer risk [17]. Previously we 
identified a naturally occurring BRCAl allele, R1699W, that 

displayed impaired transcriptional activity in standard conditions 
(37°G), but showed normal activity when cells are shifted to 30°C 
indicating that the protein is able to restore proper folding at lower 
temperature [21]. Further analysis also revealed a significant 
decrease in peptide binding sensitivity/specificity in comparison to 
wild-type [17]. Interestingly, the arginine residue at position 1699 
is involved in a salt bridge with a glutamic acid and an aspartic 
acid at positions 1836 and 1840, respectively [15]. In order to 
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Figure 2. Phylogenetic conservation of amino acid residues in tlie surround region of BRCA1 analyzed variants. A multiple sequence 
alignment demonstrating amino acids residues evolutionary conservation are shown from H.sapiens (NP_009225.1), P.troglodytes (NP_00 1038958.1), 
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M. mulatto (NP_001 108421 .1), C.lupus (NP_001 01 3434.1), B.taurus (NP_848668.1), M. musculus (NP_033894.3), R. norvegicus (NP_036646.1) and G. 
gallus (NP_989500.1). Target amino acids residues are depicted in light grey and salt-bridge involved residues in dark grey. 
doi:1 0.1 371 /journal.pone.0097766.g002 



dissect the role of these residues in this molecular interaction we 
generated a series of BRC A 1 tBRCT single amino acid variants at 
positions 1699 and 1836 (Figiare IB). 

We investigated these variants using the transcription activation 
assay in three different temperatures to assess temperature- 
sensitive behaviors (Figures 4A and 4B). Variants R1699I, 
R1699F, R1699E, R1699G, R1699Q,and R1699H showed low 
activity across different temperatures. Interestingly, variant 
R1699L displayed an activity comparable to the wild type 
reference consistent across different temperatures (Figure 4A). 
One variant, a conservative change from an arginine residue to a 
lysine, R1699K, displayed a temperature dependent behavior, 
presenting transcriptional activation ~80% of wt at 30°C and 
progressively decreasing with temperature. At 37°C this variant 
showed a significantiy reduced activity (38% of wt. Figure 4B). 
This behavior was consistent in two different cell lines, HEK293T 
and HOC 1937. 

We also tested three variants at position 1836, where the 
original glutamic acid residue was replaced by an aspartic acid, a 
glycine or a lysine. E1836D exhibited a somewhat reduced 
transcriptional activation values that were comparable at 33°C 
and 37°C (Figure 4C). Variants E1836G and E1836K, on the 
other hand exhibited a temperature-sensitive behavior (Figure 4C). 

Then, we combined the R1699K (temperature-sensitive) or the 
R1699E (not temperature-sensitive) with variations at the 1836 site 
and tested the double mutants at 33°C and 37°C. All R1699K 
double mutants retained the temperature-sensitive behavior but 
only the conservative E1836D mutations retained normal activity 
at the permissive temperature (33°C) highlighting the role of both 
residues (1699 and 1836) on the stability of the BRCT domains 
(Figure 4D). Moreover, changes in one residue can compensate for 
changes in the other as demonstrated by the R1699E double 



mutants, which show levels of activity comparable to wt at the 
permissive temperature but dramatically reduced levels when cells 
are shifted to 37°C. In particular, the R1699E/E1836G showed 
the largest difference of activity between the two temperatures and 
might constitute a useful tool to probe the function of BRCAl 
(Figure 3E). 

VarCall 

Next, we evaluated the activity of missense variants using the 
VarCaU computational model [18]. Results from this model reflect 
the likelihood of pathogenicity given the functional impact data. 
VarCall results are depicted in Figure 5 where each variant's 
activity is represented by a boxplot summarizing the marginal 
posterior distribution of its random effect. A point estimate of the 
mixture model is plotted on the right margin. Its top component 
corresponds to variants with no functional impact, whereas its 
bottom component corresponds to variants with functional impact. 
VarCall data indicate that C1697F, G1748D and A1752T have 
significant impact on function and are likely to represent 
pathogenic variants (Figure 5). Ml 41 IT variant also showed 
reduced activity (Figure 5), resulting in a reduced but still 
significant probability of being pathogenic (0.35, Table SI). 

Discussion 

A significant percentage of genetic tests conducted for breast 
and ovarian cancer susceptibility results in findings of a variant of 
uncertain clinical significance (VUS). While some VUS are located 
in intronic or other putative regulatory regions, many are missense 
variants. These individual VUS alleles usually are very rare in the 
population precluding family-based or population-based genetic 
analysis to determine their disease association. Functional assay 
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Figure 3. Functional analysis of missense variants in BRCA1 C-terminai region. Transcriptional activity of BRCAl variants were evaluated in 
HEK293T cells using a GAL4-responsive firefly luciferase reporter gene (shown above the graphs) at 37'C. Cells were harvested 24h after transfections 
and the lysate was used to assess transcriptional activation ability by luciferase activity measurement. Activity is depicted as % of the wild-type 
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have been used to determine whether specific amino acid changes 
lead to detectable hmctional impact in a number of biochemical 
and biological processes that have been associated with BRCAl 
([1 1] and references therein). 

In this study we focused on the transcriptional activation assays 
for BRCAl. The method is based on the ectopic expression of 
BRCAl C-terminal fragments fused to GAL4-DBD and the ability 
of the resulting chimeric protein to activate transcription of a 
reporter gene [27,28]. Interestingly, there is a strong correlation of 
transcriptional activation results and other biochemical activities 
assigned to BRCAl such as the specific recognition of phosphor- 
ylated peptides [17] indicating that the transcriptional assay 



functions as a monitor of the structural integrity of the BRCAl C- 
terminal region. Importantly, the TA displays a strong correlation 
with pathogenicity indicating that the assay is specific and sensitive 
for BRCAl missense variants in the C-terminus [17]. 

We analyzed 18 naturally occurring BRCAl VUS (Figure lA) 
using the TA assay [11,19]. We also tested two artificial missense 
variants targeting phosphorylation sites in BRCAl (Figure lA). 
Transcriptional activity data was assessed by VarCall, a recently 
developed computational tool to infer disease association from 
functional data [18]. 

M1411T, C1697F, G1748D, and A1752T displayed signifi- 
cantly decreased transcriptional activity compared to the wt 
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Figure 5. Estimated Variant Specific Effects (Bayesian statistical model, VarCall, graphical summary). Variants are depicted in order of 
amino acid from residues 1396 to 1863. The top panel shows secondary structures in the C-terminal region. Coiled-coil region and a-helixes are 
depicted as pink and blue boxes, respectively, p-sheets are shown as gray arrows. The shaded area on the graph corresponds to different structures 
of similar color. The linker region is indicated with green shading. Each variant's activity is represented by a boxplot summarizing the marginal 
posterior distribution of its random effect. A point estimate of the mixture model is plotted on the right margin. Its top component corresponds to 
variants with no functional impact, whereas its bottom component corresponds to variants with functional impact. The mean of the benign/ 
damaging component is plotted as a green/red dotted horizontal line. Yellow box represents wild-type reference, green and red boxes represent low 
(class 1 and 2) and high (class 4 and 5) risk variants respectively, classified according to the International Agency for Research on Cancer (lARC), purple 
boxes represent VUS previously analyzed by our group (used to feed VarClass algorithm), and blue boxes represent the VUS analyzed by the first time 
in this study. 
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reference and were, with the exception of M 1 4 1 1 T, determined to 
be hkely to be pathogenic by VarCall (Figure 5, Table SI). 
Different variants on the same positions (A752V, A1752P, and 
C1697R) lead to severe protein folding defects inferred by 
increased protease sensitivity and impaired transcriptional activity 
[17]. Taken together these data highlight the relevance of these 
amino acid residues (C1697 and A1752) for the integrity of the 
BRCAl tBRCT structure. Assessment of M141 IT by VarCall was 
inconclusive. This variant, first reported in a Swedish ovarian 
cancer patient with a family history of breast cancer and other 
malignancies [29], hes on the BRCAl coiled-coil domain and was 
reported to disrupt the interaction between BRCAl and PALB2 
[30]. 

All other natural variants tested (S1457A, S1524A, I1529V, 
S1542A, E1660V, L1679Q, E1731D, D1733Y, E1754V, 
E1765K, S1790Y, H1805L, D1813H and L1844P) displayed 
transcriptional activity comparable to the wt reference, as did the 
two artificial variants analyzed (SI 42 3 A and SI 466 A). These 
natural variants were determined to be unlikely to be pathogenic 
by VarClass (Figure 5, Table SI). Because unphosphorylatable 
(alanine) variants at phosphorylation sites did not have impact on 
transcriptional activity we conclude that phosphorylation of these 
residues is not required for transcription activation under normal 
conditions. They also provide a demonstration that serine to 
alanine substitutions in these residues do not induce dramatic 
structural changes to the tBRCT domains. Our data do not 
address the relevance of these variants following DNA damage, 
but it is clear that these sites are critical for DNA injury induced 
ATM phosphorylation of BRCAl, and the overall repair response 
[13,20,31,32]. 

Next, we probed the role of the salt bridge that stabilizes the 
tandem BRCT interaction between residues R1699, E1836, and 
D1840 [15]. The temperature-dependent effects of the R1699W 
variant on transcriptional activity were previously reported [21]. 
We performed site-directed mutagenesis generating a series of 



eight variants of this residue (Figure IB). Except for R1699L that 
behaved as reference in all tested conditions, and R1699K, 
discussed below, all other variants showed low TA values in all 
ranges of temperature tested (30°C to 37°C, Figure 4). Interest- 
ingly, the R1699L variant displayed impaired phosphopeptide 
binding activity [17] suggesting that this variant can be used to 
uncouple transcriptional activation from phosphopeptide binding. 
In addition to its role in the salt bridge, R1699L and R1699W, are 
predicted to abolish the interaction of the T1700 residue with 
PRKCD, PRKC, PRKCQ, PRKCZ, PRKCA, PRKCG and 
MST2 [33,34]. Interestingly, R1699K was found to have a 
temperature-dependent behavior, showing about 80% of wild-type 
activity at 30°C and <40% activity at 37°C. These results were 
confirmed in the BRCAl -defident HCC1937 ceUs [25] (Figure 4). 

We also generated aspartic acid, glycine and lysine variants at 
position 1836 (Figure IB). While the conservative change E1836D 
showed no temperature-dependent behavior, the E1836G and 
E1836K have significantly lower transcriptional activity at 37°C. 
E1836K was also reported to have modest effects on protein 
folding and transcriptional activation but significant decrease in 
peptide binding and specificity [17] in a pattern reminiscent of the 
R1699L variant. Then, we combined variants for R1699K and 
R1699E, which had temperature dependent and independent 
behaviors, respectively (Figure IB and 4) with variants in El 836. 

As expected, for double variants of R1699K, the substitution of 
a glutamic acid by an aspartic acid (E1836D) did not change the 
pattern observed for the single R1699K. R1699K double variants 
containing a non-charged (E1836G) or a positively charged amino 
acid residue (E1836K) resulted in reduced transcriptional activities 
even at low temperatures (Figure 4). R1699E double variants 
showed the largest difference in activities from the permissive to 
the non-permissive temperature of all variants, suggesting that 
these double variants can be used as genetic tools to investigate the 
role of different biochemical processes mediated by the BRCT 
domains. 
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The study of temperature influence on BRCAl structure/ 
function is especially relevant because temperature-sensitive 
variants may have different clinical presentations or penetrances. 
The evaluated variants could, judged by the behavior of the 
R1699W allele be potential intermediate risk variants. [21]. 

Finally, combined with the natural variants in this study (Table 
SI) we examined 170 unique variants using VarCall and 
generated likelihoods of pathogenicity. Note that because this tool 
uses a Bayesian approach every variant, even the 82 variants 
assessed previously [18] are re-evaluated given the new data. This 
analysis (Figure .5) reveals interesting insights about the architec- 
ture of the C-terminus. Some variants in tlu; coilcd-cxiil region 
displayed functional impact albeit a moderate on(-. The disordered 
region, located N-terminally to the BRCT domains is tolerant to 
changes. Similarly, al helixes on both BRCTs seem also to be 
tolerant to changes. Otherwise, most secondary structures and the 
linker region tend to be sensitive to changes. A note of caution is 
warranted here as these results, obtained in a research environ- 
ment, are not meant to guide chnical decisions. Although the 
VarCall tool infers the likelihood of a variant being pathogenic, 
the results provided here are derived from a single data source 
(activity of a functional assay). Only determination of pathogenic- 
ity by a multifactorial likelihood model using independent data 
sources (e.g. segregation analysis, allele frequency, tumor pathology 
markers, co-occurrence, and co-observation with BRCA2 patho- 
genic variants) should be considered clinically [35,36,37]. 
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